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Using the observed proportionality of CCSD(T) and MP2 correlation interaction energies [I. 
Grabowski, E. Fabiano, F. Della Sala, Phys. Chem. Chem. Phys. 15, 15485 (2013)] we propose a 
simple scaling procedure to compute accurate interaction energies of non-covalent complexes. Our 
method makes use of MP2 and CCSD(T) correlation energies, computed in relatively small basis 
sets, and fitted scaling coefficients to yield interaction energies of almost complete basis set limit 
CCSD(T) quality. Thanks to the good transferability of the scaling coefficients involved in the 
calculations, good results can be easily obtained for different intermolecular distances. 


I. INTRODUCTION 

Non-covalent interactions play a fundamental role in 
chemistry and chemical-physics because of their impor¬ 
tance in many phenomena including, among others, bio¬ 
chemistry, solvation, supermolecular organization, and 
molecular recognition. For this reason they have been 
subject of intense research^—. However, because the 
strength of non-covalent interactions is one or two order 
of magnitude smaller than that of typical covalent inter¬ 
actions, ranging from few tens to few tenths of kcal/mol, 
special care is required for accurate quantitative studies. 

From a computational point of view this implies the 
need for high-level correlated wave function methods, 
being capable to describe correlation effects with good 
accuracy^. Additional computational issues, such as the 
basis set superposition and the basis set incompleteness 
errors, must be properly taken into account as well^ 1 . 
Finally, it must be considered that, because of the rather 
weak nature of non-covalent bonds, the description of 
non-covalent complexes cannot be restricted to the equi¬ 
librium geometry, but, in many cases, must be extended 
to include a more or less extended portion of the potential 
energy surface (PES). 

Actually the method of choice for an accurate compu¬ 
tational description of non-covalent complexes is the cou¬ 
pled cluster single and double with perturbative triple 
(CCSD(T)) approach^— with a complete basis set 
(CBS) description^. This method provides in fact inter¬ 
action energies with subchemical accuracy and yields er¬ 
rors often below 1 kcal/mol^. Nevertheless, the CCSD(T) 
method is computationally very demanding, scaling as 
0(N 7 ), thus its applicability is limited to small com¬ 
plexes. On the other hand, lower level approaches, 
such as Mpller-Plesset second-order perturbation theory 
(MP2)^ or even CCSD(T) calculations with a relatively 
small basis set, fail to describe various types of non- 
covalent interactions with the necessary accuracy. There¬ 


fore, continuous effort is dedicated to the development of 
new methods able of providing reliable results for non- 
covalent complexes at a reduced computational cost. 

Recently one step in this direction has been performed 
by proposing a simple non-empirical scaling procedure 
for MP2 interaction energies which allows to obtain a 
full dissociation curve of CCSD(T) quality at the cost of 
a single CCSD(T) calculation^. This method is based 
on an observed proportionality of the MP2 and CCSD(T) 
interaction energies which is maintained, with almost a 
constant factor, over a wide range on inter-molecular dis¬ 
tances. Thus, accurate dissociation curves can be com¬ 
puted, with a precision of few hundreds of mHa, for many 
non-covalent complexes with a reduced effort. However, 
for high accuracy the proposed scaling procedure still re¬ 
quires one CCSD(T)/CBS calculation, which may be out 
of reach for larger scale applications. 

In this paper we address this latest issue and propose a 
new methodology which joins the above mentioned scal¬ 
ing procedure with a basis set extrapolation scheme, so 
that finally highly accurate interaction energies can be 
computed by the sole requirement of relatively low-level 
calculations (e.g. MP2 and/or CCSD(T) with moderate 
basis sets). In this way a practical and efficient tool for 
large-scale application can be obtained. 


II. METHOD 

The main quantity of interest in this work is the cor¬ 
relation interaction energy. For calculations carried out 
with method X and using the basis set aug-cc-pVnZ it 
is denoted £ ^ an d defined as the difference of the cor¬ 
relation energies ( E calculated using method X and 
basis set aug-cc-pVnZ, with n = 2, 3,4, 5 i.e. D,T,Q,5) 
for a complex AB and its constituent fragments A and 


2 


B , i.e. 


4 n) = 45 [^i - 45 m - 45 ■ w 


According to Ref* 17 the CCSD(T) correlation inter¬ 
action energy of a non-covalent complex at any inter- 
molecular separation R. is related to the MP2 correlation 
interaction energy by the simple equation 

4csd ( t)( fl)«c (n) 45 2 (/2) , (2) 


where the factor cis evaluated as 
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with R being any reference point (for example, but not 
necessarily, the equilibrium inter-molecular separation 
R 0 ). Equations © and © allow to compute a full PES 
(with N points) of CCSD(T) quality at the cost of a sin¬ 
gle CCSD(T) calculation (at distance R) and several (i.e. 
N) MP2 calculations. 

When the CBS limit is considered, which formally 
corresponds to n = oo, the scaling factor of Eq. © 

shall be computed as c^°°^ = 4 csd(t)/ 4 ip 2 > where both 
4csd(t) an< ^ 44 have to be obtained via appropriate 
extrapolation formulas (e.g. Refsj 26 ' 40 ). However, the 
bottleneck of this computational approach is certainly 
the calculation of 4csd(t)- To avoid this we consider 

an alternative way to compute c^°°\ To this end we as¬ 
sume that the basis set evolution of the proportionality 
factor is described by the ansatz 

c {n) = c (oo) + An -a t (4) 


where A and a are assumed to be two system-dependent 
constants (note that they do not depend on n). A ra¬ 
tionalization for this ansatz is given in 0 A graphical 
impression of its accuracy for some typical non-covalent 
complexes is presented in Fig. |T| This figure shows in 
fact, for some selected complexes, the evolution of the 
values of the scaling factors c ^ with the basis set as 
well as the fit obtained via Eq. ©■ We see that the pos¬ 
tulated power behavior reproduces very well the trend 
of the computed scaling factors from l/n = 0.5 (aug-cc- 
pVDZ) to l/n = 0 (CBS limit). 

In practice, we apply Eq. © for two given values of n 
(we use n = 2 and n = 3) so that the CBS proportionality 
factor is given by the simple two-point formula 


r (°°) 

approx 



(5) 


where the only unknown parameter, so far, is a. Equa¬ 
tion © allows to obtain an estimate for c^°°^ avoiding 
the computationally expensive calculation of 4 csd(t)> 
requiring at most the evaluation of 4 csd(t)’ which is 



FIG. 1: Scaling factors cas functions of l/n (where n de¬ 
fines the basis sets aug-cc-pVnZ) for some non-covalent com¬ 
plexes. The dashed line shows the fit according to Eq. a. 


necessary to obtain cD) from Eq. © . On the other hand, 
the value of a should be, in principle system dependent. 
However, following Ref3l, we will fix it by fitting to some 
appropriate training set (see later on). Hence, Eq. © 
becomes a universal formula for obtaining ci^-ox- 

Using Eq. © it is possible to obtain a good approx¬ 
imation of 4 csd(t) h 1 a very efficient way via Eq. ©. 
Nevertheless, two possible variants can be considered: 


• In a first approach, that we denote Ml, we will use 
the best estimate available for 4 iP 2 , practice, a 
cubic extrapolation formula-— using aug-cc-pVQZ 
and aug-cc-pV5Z data (other extrapolation formu¬ 
las will not be considered here, since they will yield 
only slightly different results). Hence, the Ml cor¬ 
relation interaction energy is computed as 


, , 53 _ p( 4 ) 43 

C _ „(°°) °MP 2 ° °MP 2 ^ 

OM 1 — C M1 ^3 _ ^3 
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where c^f* is given by Eq. © with a fitted on a 
training set to reproduce reference values of the c 
coefficient at the reference CBS limit. Within this 
approach, to compute a full PES of N points with 
CCSD(T)/CBS quality, the most computationally 
intensive steps required are one CCSD(T)/aug-cc- 
pVTZ calculation and N MP2 calculations with 
large basis set (aug-cc-pV5Z in our case), one for 
each point of the PES. 


• A cheaper method, which will be denoted M2, can 
be obtained using a simpler formula to evaluate 
£mp 2 - Doing this by means of the approach of 
Ref. 2 - we obtain 


£m 2 — c, 
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TABLE I: Computed and extrapolated values of the propor¬ 
tionality factors c (7l) for the systems of the training set. 


System 

c< 2 > 

c< 3 > 

c (oo) 

r (°°) 

C M 1 

J°°) 

C M 2 

He 2 

1.262 

1.261 

1.248 

1.261 

1.261 

Ne 2 

1.185 

1.236 

1.273 

1.274 

1.253 

He-Ne 

1.229 

1.266 

1.274 

1.294 

1.279 

Ar 2 

0.860 

0.905 

0.913 

0.939 

0.921 

(H 2 0) 2 

1.048 

1.031 

0.937 

1.019 

1.026 

(H 2 S) 2 

0.834 

0.876 

0.922 

0.907 

0.890 

(HF) 2 

1.184 

1.088 

0.924 

1.017 

1.056 

(HC1) 2 

0.791 

0.846 

0.853 

0.887 

0.865 


TABLE II: Extrapolated correlation interaction energies 
(kcal/mol) for the systems of the training set computed with 
different methods. Reference values are CCSD(T)/CBS re¬ 
sults. The last line reports the mean absolute error (MAE) 
and the mean absolute relative error (MARE). 


System 

MP2-23 

CCSD(T)-23 

Ml 

M2 

Ref 

He 2 

-0.03 

-0.04 

-0.04 

-0.04 

-0.04 

Ne 2 

-0.09 

-0.11 

-0.11 

-0.11 

-0.11 

He-Ne 

-0.06 

-0.08 

-0.08 

-0.08 

-0.08 

Ar 2 

-0.67 

-0.61 

-0.63 

-0.61 

-0.61 

(H 2 0) 2 

-1.36 

-1.42 

-1.39 

-1.38 

-1.27 

(H 2 S) 2 

-2.21 

-2.04 

-2.00 

-2.04 

-2.04 

(HF) 2 

-0.92 

-0.86 

-0.93 

-0.87 

-0.85 

(HC1) 2 

-2.22 

-2.01 

-1.97 

-2.01 

-1.90 

MAE 

0.09 

0.04 

0.04 

0.03 


MARE 

14.1% 

3.7% 

3.6% 

3.4% 



where the value of the coefficient 7 = 1.91 is taken 
from ReL2& and c ^ 2 is given again by Eq. ([5]) with 
a proper value of a (see below). In this case only 
<?MP 2 calculations instead of £^p 2 are required at 
most. Thus, the global computational cost of the 
whole procedure is practically reduced to the cal¬ 
culation of one ^ccsd(t) (t° compute in Eq. 
d5])) . Of course, in this case to obtain high accu¬ 
racy it is not possible to use the same a value as 
for the Ml method (hence, we used the subscripts 
Ml and M2 to denote the scaling factors in the two 
methods). On the contrary, it must be determined 
by fitting to the CBS values of CCSD(T) reference 
correlation interaction energies, and not CBS limit 
c values, in order to effectively take into account 
also the possible inaccuracies on the extrapolated 
MP2 energy. 


III. COMPUTATIONAL DETAILS 

All calculations have been performed using the PSI4 
codeia and aug-cc-pVnZ basis sets ^- 2 with n = D, T, 
Q, and 5 (i.e. =2, 3, 4, and 5). In all cases frozen core and 
resolution of identity^ (for both Hartree-Fock and corre¬ 
lated calculations) approximations have been employed. 
All results include counterpoise correction^. In all calcu¬ 
lations, unless otherwise stated, the extrapolation to the 
CBS limit has been performed using a two-point cubic 
formula based on aug-cc-pVQZ and aug-cc-pV5Z data&2£ 
(the corresponding results are indicated with the label 
method/CBS. On the other hand, extrapolated results 
denoted MP2-23 and CCSD(T)-23 have been obtained 
using the method and parameters reported in Ref^. A 
similar notation is used for the scaling coefficients c ^ 
(see Eq. J3])). 

For the parametrization and testing of the methods 
defined in this work we considered various small non- 
covalent complexes having different interaction charac¬ 
ters. These include He 2 , Ne 2 , He-Ne, Ar 2 , Ne-Ar, CH 4 - 
Ne (dispersion interaction; DI), (H 2 0 ) 2 , (HF) 2 , NH 3 - 
H 2 0, (NH 3 ) 2 (hydrogen bond; HB), (H 2 S) 2 , (HC1) 2 , 
H 2 S-HC1 (dipole-dipole interaction; DD), HCN-C1F, 
NH 3 -F 2 (charge-transfer interaction; CT), and LiH-HF 
(dihydrogen interaction; DH). The geometries of the 
complexes were taken from Refs. 1 '’ 2 ' -3 !:. Finally, in ad¬ 
dition, we considered the systems from the S22x5 test 
seiM. 

As reference benchmark data CCSD(T)/CBS results 
obtained extrapolating from aug-cc-pVQZ and aug-cc- 
pV5Z calculations or accurate CCSD(T)/CBS results 
from literature (when indicated). These accurate en¬ 
ergies are employed to assess the accuracy of all the 
methods based on -23 extrapolation, i.e. our own meth¬ 
ods as well as MP2-23 and CCSD(T)-23 calculations. 
Moreover, for completeness, we have considered also 
SCS(MI)-MP2/CBS2£ and MP2.5/CBS^ interaction en¬ 
ergies. These methods were in fact developed to “repro¬ 
duce” CCSD(T) results at a cost close to MP2 calcula¬ 
tions. Thus, it is in the same spirit as the ones developed 
in the present work. 


A. Parametrization 

To perform the fit required to fix the parameter a in 
both the Ml and M2 methods, we consider a training set 
composed of the following non-covalent complexes: He 2 , 
Ne 2 , He-Ne, Ar 2 (dispersion complexes), (H 2 0 ) 2 , (HF ) 2 
(hydrogen bond complexes), and (H 2 S) 2 , (HC 1)2 (dipole- 
dipole complexes), at equilibrium geometry. The values 
of c, c, and c^°°^ for the various systems are reported 
in Table HI Note that in Table [Q denotes the “true” 
CBS limit value of the scaling factor, computed as the 
ratio of the extrapolated CCSD(T)/CBS and MP2/CBS 
energies. 

Considering method Ml, we need to fit Eq. J5]) to the 
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c*- 00 ) values. Thus, we minimized the target function 

f7M i(a) = ^( C ^ ) (a)H-c (oo) [ ? ]) 2 , (8) 

i 

where the sum runs over all the systems in the training 
set and the notation <4^(a)[?] indicates that we con¬ 
sider the value relative to system i and computed 
at a given value of a. Doing so we obtained a = 2.1 and 
the values reported in Table HI The fitted coefficients 
show a mean absolute error (MAE) of 0.035 and a mean 
absolute relative error (MARE) of 3.7% with respect to 
the c(°°) ones. The corresponding correlation interaction 
energies, obtained using the Ml approach, are listed in 
Table El They display a good agreement with the refer¬ 
ence CCSD(T)/CBS energies having a MAE of 0.06 mHa 
and a MARE of 3.6%, being similar to CCSD(T)-23 ex¬ 
trapolated energies. 

To fit for the M2 method we consider instead, 
as discussed before, as target values the reference 
CCSD(T)/CBS correlation interaction energies of Table 
El hence we minimize 

a M 2 (ck) = 'y ' ^M2(o:)[f] — ^CCSD(T)/CBsI*]) > (®) 

i 

where Sm 2 (o)[{\ depends on a through ■ This yields 
a = 3.4 and the scaling factors displayed in the last col¬ 
umn of Tab. |U When used in the M2 approach these 
yield the final M2 energies reported in Tab. El These en¬ 
ergies show by construction a very good agreement with 
reference values, having a MAE of only 0.05 mHa. 

IV. RESULTS 

To test the methods developed in the previous sections 
we applied them to the computation of the interaction en¬ 
ergies of several non-covalent complexes. For complete¬ 
ness we considered energies both at the equilibrium dis¬ 
tance Ro and at elongated distances equal to 1.2i?o and 
1.5i?o. The values of the correlation interaction energies, 
obtained from various methods, are reported in Table 
ED In addition Fig. [2] reports the absolute error on the 
correlation interaction energy as obtained from different 
methods for some complexes, taken as examples, includ¬ 
ing also shorter (i.e. 0.9i?o) and larger (i.e. up to 2.0i?o) 
distances. 

Inspection of the table and the figure shows that both 
the Ml and the M2 methods perform very well for 
the test set, being competitive with CCSD(T)-23 and 
MP2.5/CBS, at all distances, and superior to MP2-23 
and SCS(MI)-MP2/CBS. In more detail, the Ml method 
seems to perform slightly better than M2 for hydrogen- 
bond complexes at equilibrium distance, while M2 gives 
better results for dispersion complexes and in general 
at displaced geometries. These differences are however 
rather small and shall be considered with caution due to 



FIG. 2: Absolute error with respect to reference val¬ 
ues (CCSD(T)/CBS) of correlation interaction energies 
(kcal/mol) computed with different methods and at differ¬ 
ent intermolecular distances R (Ro denotes the equilibrium 
distance) for some non-covalent complexes. 


the small dimension of the test set. We recall that both 
methods have a computational cost comparable with 
CCSD(T)-23 when a single bond distance (e.g. R = Ro) 
is considered and smaller than it when several bond- 
distances are involved. In particular the M2 method 
as a computational cost almost comparable to MP2- 
23 (and SCS(MI)-MP2/CBS), but an accuracy close to 
CCSD(T)-23. Note however that some the methods con¬ 
sidered here are based on extrapolation form triple-zeta 
quality basis sets at most. Thus, it may happen that they 
display shortcoming for some systems where this level of 
basis set is not fully sufficient to describe the correla¬ 
tion effects. This appears to be the case, for example of 
the HCN-C1F complex which shows a quite larger rela¬ 
tive error than other complexes, especially at R = 1.5I?o, 
possibly because of the limitations of the aug-cc-pVTZ 
basis set to describe correlation in the C1F molecule. 

As a further and more extended test we consider in Fig. 
[3] (see also supporting information), for the methods re¬ 
quiring at most triple-zeta basis set calculations, the ab¬ 
solute errors with respect to accurate values^ of correla¬ 
tion interaction energies for the complexes of the S22 test 
set for Ro, 1.2 R 0 , and 1.5i?o- Accurate reference correla¬ 
tion interaction energies have been obtained by subtract¬ 
ing CBS Hartree-Fock values from the benchmark ener¬ 
gies of Refj-22. Statistics for these data are summarized 
in Table IIVI Finally, a comparison of total interaction 
energy errors with several literature result s 32 ’ 34 ! 35 is also 
reported in Fig. [I] The plots reported in the figure as 
well as the data of Table HVl confirm the results of Tables 
E! and ED showing that the M2 method is competitive 
with the CCSD(T)-23 approach, as well as with other 
high level methods (e.g. accurate MP2.5, MP2C, or SCS- 
CCSD) and can clearly improve over the MP2-23 and 
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TABLE III: Computed and extrapolated correlation interaction energies (kcal/mol) for the systems of the test set at different 
intermolecular distances (Ro denotes equilibrium distance). Reference values are CCSD(T)/CBS results. The label SCS(MI) 
stands for SCS(MI)-MP2/CBS. In the second column the interaction character for each complex is reported. The last line 
reports the mean absolute error (MAE) and the mean absolute relative error (MARE). 


System 

Interaction 

MP2-23 

SCS(MI) 

MP2.5/CBS 

CCSD(T)-23 

Ml 

M2 

Ref. 





O 

a? 

II 





HCN-C1F 

CT 

-3.90 

-2.72 

- 3.11 

-2.83 

-2.81 

-2.82 

-2.68 

NH 3 -F 2 

CT 

-2.33 

-1.56 

-2.08 

-2.08 

-2.10 

-2.09 

-2.09 

LiH-HF 

DH 

-3.60 

-2.65 

-2.85 

-3.06 

-2.97 

-3.03 

-2.87 

NH 3 -H 2 O 

HB 

-2.17 

-1.72 

-2.09 

-2.08 

-2.06 

-2.10 

-1.96 

(NH 3 ) 2 

HB 

-1.70 

-1.27 

-1.64 

-1.66 

-1.60 

-1.61 

-1.57 

HCI-H 2 S 

DD 

-3.27 

-2.45 

-2.97 

-2.67 

-2.59 

-2.68 

-2.55 

CH 4 -Ne 

DI 

-0.28 

-0.17 

-0.29 

-0.30 

-0.33 

-0.29 

-0.29 

Ne-Ar 

DI 

-0.29 

-0.08 

-0.29 

-0.24 

-0.24 

-0.24 

-0.24 

MAE 


0.41 

0.21 

0.14 

0.09 

0.06 

0.08 


MARE 


18.8% 

22.7% 

7.9% 

4.2% 

4.0% 

3.7% 






R = 1.2Ro 





HCN-C1F 

CT 

-1.12 

-1.40 

-0.84 

-0.64 

-1.07 

-0.81 

-0.90 

NH 3 -F 2 

CT 

-0.76 

-0.81 

-0.68 

-0.58 

-0.85 

-0.68 

-0.74 

LiH-HF 

DH 

-1.99 

-2.26 

-1.80 

-1.52 

-1.84 

-1.68 

-1.73 

NH 3 -H 2 O 

HB 

-1.02 

-1.10 

-0.98 

-0.91 

-1.09 

-0.99 

-0.93 

(NH 3 ) 2 

HB 

-0.78 

-0.78 

-0.76 

-0.68 

-0.83 

-0.77 

-0.70 

HCI-H 2 S 

DD 

-1.36 

-1.39 

-1.22 

-0.97 

-1.10 

-1.11 

-1.08 

CH 4 -Ne 

DI 

-0.07 

-0.14 

-0.07 

-0.06 

-0.10 

-0.08 

-0.09 

Ne-Ar 

DI 

-0.04 

-0.08 

-0.04 

-0.04 

-0.07 

-0.04 

-0.06 

MAE 


0.12 

0.21 

0.06 

0.11 

0.09 

0.05 


MARE 


16.2% 

30.7% 

11.3% 

19.1% 

14.6% 

9.1% 






R = L5.Ro 





HCN-C1F 

CT 

-0.34 

-0.25 

-0.26 

-0.05 

-0.22 

-0.17 

-0.11 

NH 3 -F 2 

CT 

-0.19 

-0.21 

0.17 

-0.16 

-0.21 

-0.18 

-0.21 

LiH-HF 

DH 

-0.71 

-1.04 

-0.64 

-0.51 

-0.72 

-0.60 

-0.65 

NH 3 -H 2 O 

HB 

-0.28 

-0.31 

-0.26 

-0.24 

-0.26 

-0.28 

-0.27 

(NH 3 ) 2 

HB 

-0.21 

-0.21 

-0.20 

-0.19 

-0.21 

-0.21 

-0.21 

HCI-H 2 S 

DD 

-0.35 

-0.37 

-0.30 

-0.23 

-0.31 

-0.29 

-0.31 

CH 4 -Ne 

DI 

-0.01 

-0.02 

-0.01 

-0.01 

-0.02 

-0.01 

-0.02 

Ne-Ar 

DI 

-0.01 

-0.01 

-0.01 

-0.02 

-0.01 

-0.01 

-0.02 

MAE 


0.04 

0.08 

0.03 

0.04 

0.03 

0.03 


MARE 


37.5% 

33.6% 

28.0% 

22.9% 

16.4% 

17.9% 



SCS(MI)-MP2/CBS results. We remark that this per¬ 
formance is obtained for all distances by requiring only a 
single CCSD(T)/aug-cc-pVTZ calculation. Moreover, al¬ 
though no clear systematic trend can be obtained for the 
accuracy of different methods versus the bonding charac¬ 
ter type, we remark that the observed behavior of M2 is 
rather consistent for all the complexes and distances with 
CCSD(T)-23, being even slightly superior for dispersion 
complexes. On the contrary, MP2-23 appears rather ac¬ 


curate only for hydrogen bond complexes. Finally, we 
remark that for d = Rq, where more accurate reference 
data2£ than those in Ref»22 are available, the results do 
not change substantially when the best reference values 
are used. In fact, in this latter case the MAEs (MAREs) 
of MP2-23, SCS(MI)-MP2/CBS, CCSD(T)-23, and M2 
are 1.13 (19.5%), 0.70 (14.5%), 0.28 (6.0%), and 0.31 
(6.7%), respectively, in good agreement with the values 
reported in Table hyi 
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TABLE IV: Overall statistics, including mean absolute errror (MAE) in kcal/mol and mean absolute relative error, for the 
performance of different methods on the S22 test set at various equilibrium distances. The label SCS(MI) stands for SCS(MI)- 
MP2/CBS. The best result for each line is highlighted in bold style. 


R/Ro 

Interaction 


MP2-23 

SCS(MI) 

CCSD(T)-23 

M2 

0.9 

H-bond 

MAE 

2.02 

0.79 

0.63 

0.62 



MARE 

13.1% 

20.1% 

9.0% 

9.0% 


Dispersion 

MAE 

0.71 

0.81 

0.31 

0.30 



MARE 

18.6% 

9.3% 

4.6% 

4.3% 


Mixed 

MAE 

2.25 

1.04 

0.39 

0.37 



MARE 

25.3% 

13.9% 

5.2% 

4.5% 


All 

MAE 

1.61 

0.88 

0.44 

0.42 



MARE 

19.0% 

14.2% 

6.2% 

5.9% 

1.0 

H-bond 

MAE 

1.29 

0.72 

0.21 

0.24 



MARE 

12.2% 

19.3% 

8 .0% 

11.5% 


Dispersion 

MAE 

0.45 

0.50 

0.22 

0.27 



MARE 

19.6% 

9.2% 

3.3% 

3.1% 


Mixed 

MAE 

1.72 

0.88 

0.24 

0.26 



MARE 

31.5% 

21.1% 

5.6% 

5.4% 


All 

MAE 

1.12 

0.69 

0.22 

0.26 



MARE 

21.1% 

16.2% 

5.5% 

6.5% 

1.2 

H-bond 

MAE 

0.56 

0.27 

0.14 

0.15 



MARE 

6.2% 

15.4% 

7.8% 

8.9% 


Dispersion 

MAE 

0.17 

0.28 

0.08 

0.08 



MARE 

21.9% 

11.9% 

5.6% 

5.5% 


Mixed 

MAE 

0.73 

0.41 

0.18 

0.16 



MARE 

32.4% 

26.5% 

8.6% 

7.8% 


All 

MAE 

0.47 

0.31 

0.13 

0.13 



MARE 

20.3% 

17.6% 

7.2% 

7.3% 

1.5 

H-bond 

MAE 

0.18 

0.10 

0.07 

0.07 



MARE 

34.5% 

35.4% 

14.3% 

9.0% 


Dispersion 

MAE 

0.07 

0.09 

0.05 

0.05 



MARE 

23.0% 

10.9% 

14.6% 

13.3% 


Mixed 

MAE 

0.25 

0.15 

0.09 

0.08 



MARE 

45.9% 

62.7% 

36.0% 

42.0% 


All 

MAE 

0.16 

0.11 

0.07 

0.06 



MARE 

33.9% 

35.2% 

21.3% 

21 .1% 


V. CONCLUSIONS 


We have proposed efficient scaling procedures to com¬ 
pute accurate correlation interaction energies of non- 
covalent complexes. Our methods are based on the ob¬ 
served proportionality between MP2 and CCSD(T) in¬ 
teraction energies as well as on simple basis set extrapo¬ 
lation formulas. In this way correlation interaction ener¬ 
gies of CCSD(T)/CBS quality can efficiently be obtained 
from few MP2 and CCSD(T) calculations using small ba¬ 


sis sets. If several bonding distances are of interest, as in 
PES scanning studies, even a lower computational cost 
is achieved by exploiting the (almost) constancy of the 
scaling factors with respect to inter-molecular distance. 
Thus, the present methods, and in particular the M2 one, 
represent promising tools for future studies of large non- 
covalent systems, e.g. in biochemistry. 

Nevertheless, future developments can be foreseen 
starting from the present results, especially to improve 
further the overall computational efficiency. In this sense, 
it will be particularly interesting to consider the ex- 
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FIG. 3: Absolute relative errors with respect to refer¬ 
ence values^ for the correlation interaction energies of non- 
covalent complexes of the S22x5 test set at different bond¬ 
ing distances d. The numbering of the complexes is: (1) 2- 
pyridoxine-2-aminopyridine, (2) adenine-thymine [WC], (3) 
ammonia dimer, (4) formamide dimer, (5) formic acid dimer, 
(6) water dimer, (7) uracil dimer [hb], (8) methane dimer, 
(9) ethene dimer, (10) benzene-methane, (11) benzene dimer 
[stack], (12) pyrazine dimer, (13) uracil dimer, (14) indole- 
benzene [stack], (15) adenine-thymine [stack], (16) benzene- 
ammonia, (17) benzene dimer [T shape], (18) benzene-water, 


£^ 0.5 

o 

1 

2 0-4 


!h 

o 

•- 

•- 

<u 

a i 

—■ 
a 
"o 

Vl 

J 2 

CS 

e 

05 

4i 


0.3 


0.2 


0.1 


0.0 


MP2-23 

SCS(MI)-MP2/CBS 

MP2C 

MP2.5 

SCS-CCSD 

CCSD(T)-23 

M2 



d=1.2R„ 


d=1.5R„ 


FIG. 4: Mean absolute errors (kcal/mol) with respect to accu¬ 
rate values^ for the total interaction energies of non-covalent 
complexes of the S22x5 test set at different bonding distances 
d as obtained from several theoretical approaches. Accurate 
MP2C, MP2.5, and SCS-CCSD results are taken from Ref.—. 


tension of the present methodology to scaled-opposite- 
spin (SOS) MP2 calculations— In fact, the SOS-MP2 
method can be implemented with a favorable <D(N 4 ) scal¬ 
ing (as opposed to the 0(N 5 ) of conventional MP2) and 
also displays a proportionality with accurate correlation 
result o 17 : 37- — . 
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Appendix A: Rationalization of Eq. (|4]) 


Following Refs—we can write 


c( n ) _ c(°°) I n„-/3 

^CCSD(T) — ^CCSD(T) ' 


c(n 

°MP 2 


Moo) 

C MP2 


+ Cn ~ 7 


(Al) 

(A2) 


where B, C, j3, and 7 are constants. Taking the ratio of 
the two equations we find 


c 


(n) 


c(n) Moo) o -0 

c CCSD(T) _ ^CCSD(T) 


An) 

°MP 2 


c ( 00) 
C MP2 


+ Cn-'y 


Moo) 

°CCSD(T) 


/£^ 2 + Bn ~ 0 


1 + Cn-'y 


(A3) 


with B = B/ 8^2 and C = C /£^ 2 (note these are 
constants with respect to n). Now observing that 


Cn^ 


Mn) 

°MP 2 


<7(00) 

6 MP2 



< 1 , 


(A4) 


we can write 


.(") 


c (oo) + Bn - 0 


1 - Cn^ + O 


(c 2 n~ 2 ^ 


c (o °) + Bn ~ 0 -c {oo) Cn - 1 . 


(AS) 


Finally, considering that /3 ~ 7 (see Refs . 26 : 40 ) we can 
dehne two new constants ~ 7 ~ a and A = B — c^°°^C 
and obtain 


c ( ") w c (oo) + An~ a . (A 6 ) 
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